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1.  INTRODUCTION 


This  document  summarizes  work  performed  by  Delta  Information  Systems, 
Inc.  (DIS)  for  the  National  Communications  System  (NCS),  Office  of  Technology 
and  Standards.  This  office  is  responsible  for  the  management  of  the  Federal 
Telecommunications  Standards  Program,  which  develops  telecommunications 
standards,  whose  use  is  mandatory  for  all  Federal  departments  and  agencies.  The 
purpose  of  this  project,  performed  under  contract  number  DCA100-91-C-0031, 
was  to  scan  the  "eight  CCITT  test  documents"  at  pel  densities  of  200,  300,  400 
and  600  pels  per  inch,  and  store  them  on  DOS  diskettes. 

The  eight  documents  were  originally  digitized  by  the  French  Administration, 
and  were  used  in  the  Group  3  Facsimile  algorithm  selection  process,  completed  in 
1980.  These  images  are  often  referred  to  as  the  "CCITT  standard  images". 
Although  they  were  never  an  official  standard,  they  have  been  used  extensively  by 
experimenters  over  the  years.  They  were  scanned  at  roughly  200  pels  per  inch, 
consistent  with  the  capabilities  of  facsimile  machines  at  that  time. 

Recently  there  has  been  renewed  interest  in  these  images,  including  a 
proposal  in  the  CCITT  that  they  be  made  an  official  standard  (recommendation). 
Many  of  the  inquiries  for  these  images  have  been  for  digitized  images  on  DOS 
diskette  media  at  pel  densities  higher  than  200  pels  per  inch.  Current  Group  3  and 
Group  4  facsimile  machines  have  up  to  400  pel  per  inch  capability,  and  the  next 
step  could  easily  be  600  pels  per  inch. 

The  NCS  has  been  a  leader  in  the  development  and  promulgation  of 
standardized  imagery  for  facsimile.  The  NCS  has  sponsored  the  digitizing  of 
documents  at  resolutions  of  200,  240,  300,  400,  and  480  lines  per  inch.  This 
data  has  been  used  by  many  experimenters  in  the  development  of  standard 
compression  algorithms  for  digital  facsimile,  and  has  contributed  significantly  to 
the  development  of  facsimile  recommendations  which  will  be  of  considerable  value 
to  the  U.S.  Government.  In  addition,  the  NCS  sponsored  the  preparation  of 
standard  gray  scale  images,  representative  of  continuous  tone  pictures  to  be 
transmitted  through  facsimile  systems. 
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The  purpose  of  this  project  was  to  scan  the  "eight  CCITT  test  documents" 
at  pel  densities  of  200,  300,  400  and  600  pels  per  inch,  compress  the  data,  and 
store  it  on  DOS  diskettes. 

This  report  is  comprised  of  three  sections.  Section  1  provides  a  brief 
description  of  the  objectives  of  the  study  and  an  outline  of  the  contents  of  this 
report.  Section  2  discusses  the  steps  taken  to  scan  the  images.  Section  3  is  a 
guide  to  expanding  the  images  to  their  original  size. 
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2.  TECHNICAL  DISCUSSION 


Figure  1  through  Figure  8  illustrate  the  test  documents  that  were  scanned. 
The  images  used  in  the  scanning  process  were  obtained  from  SEPT’.  They  are 
from  a  set  of  original-quality  copies  made  at  the  time  that  the  Group  3  compression 
algorithm  studies  were  being  conducted.  All  of  the  images  are  A4  size,  that  is  210 
mm  wide  by  297  mm  long. 

The  resolutions  selected  for  the  scanning  process  (200,  300,  400,  and  600 
pels  per  25.4  mm)  were  based  on  those  specified  in  the  Group  3  and  Group  4 
recommendations  (200,  300,  and  400  pels  per  25.4  mm)  plus  600  pels  per  25.4 
mm.  The  additional  higher  resolution  anticipates  the  advancing  laser  printing 
technology. 

Having  selected  the  resolutions  and  given  the  width  of  the  image,  the 
number  of  pels  per  line  can  be  determined.  For  the  A4  page  width  of  210  mm,  a 
200  pels  per  25.4  mm  scan  gives  1654  pels  per  line.  The  1654  pel  width  is  not  a 
convenient  number  for  computers  (not  divisible  by  8).  One  way  to  correct  this 
condition  is  to  extend  the  right  margin  to  make  an  even  number  of  bytes,  as  the 
French  PTT  did  when  they  originally  scanned  the  images.  A  better  approach  would 
be  to  produce  a  digital  image  that  closely  approximates  the  image  produced  by  a 
Group  3  or  Group  4  facsimile  scanner.  The  latter  approach  was  chosen  for  this 
project;  that  is,  the  charts  were  scanned  to  produce  the  nominal  pels  per  line  and 
lines  per  image  shown  in  the  Group  3  and  Group  4  Recommendations.  This  means 
that  the  documents  were  centered  and  overscanned.  (The  chart  itself  is  210  mm 
wide,  but  the  scan  line  is  219.46  mm  wide.)  Figure  9  illustrates  the  scanning 
geometry  for  the  four  sampling  densities.  A  reference  point  is  defined  that  is  just 
inside  the  top  left  corner  of  the  image,  0.5  pels  from  the  corner.  The  coordinates 
of  the  reference  point,  shown  in  pels,  define  the  position  of  the  AA-size  image 
within  the  total  digitized  image.  The  blanking  margin  is  white. 

The  images  are  scanned  left-to-right  and  top-to-bottom,  just  as  in  facsimile. 


’  Mr.  Grimault  of  Service  d'Etudes  communes  de  La  Poste  et  de  France  Telecom 
graciously  provided  the  images. 
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The  scanning  process  was  performed  by  the  Image  Electronics  Center  of  Eastman 
Kodak  on  a  microdensitometer.  The  microdensitometer  samples  each  pel  at  1 2 
bits.  The  data  is  processed  to  produce  one  bit  per  pel,  stored  in  the  following  way. 
The  left-most  (first)  pel  of  the  first  scan  line  is  stored  in  the  most  significant  bit  of 
the  first  byte.  Proceeding  from  left  to  right,  pels  are  stored  in  successive  bytes,  8 
pels  to  a  byte,  from  most  significant  to  least  significant.  A  black  pel  is  a  "1"  and  a 
white  pel  is  a  "0". 
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THE  SLEREXE  COMPAxNY  LIMITED 

SAPORS  LANE  -  BOOLE  -  DORSET  -  BH  25  8  ER 
TELEPHONE  BOOLE  (945  13)  51617  •  TELEX  123456 


Our  Ref.  350/PJC/EAC 


18th  January,  1972. 


Dr.  P.N.  Cundall, 
Mining  Surveys  Led., 
Holroyd  Road, 
Reading, 

Berks . 


Dear  Pece, 

Permic  me  to  introduce  you  co  the  facility  of  facsimile 
transmission. 

In  facsimile  a  photocell  is  caused  to  perform  a  raster  scan  over 
the  subject  copy.  The  variations  of  print  density  on  the  document 
cause  the  photocell  to  generate  an  analogous  electrical  video  signal. 
This  signal  is  used  co  modulate  a  carrier,  which  is  transmitted  to  a 
remote  destination  over  a  radio  or  cable  communications  link. 

Ac  Che  remote  terminal,  demodulation  reconstructs  the  video 
signal,  which  is  used  to  modulate  the  density  of  print  produced  by  a 
printing  device.  This  device  is  scanning  in  a  raster  scan  synchronised 
with  that  at  the  transmitting  terminal.  As  a  result,  a  facsimile 
copy  of  the  subject  document  is  produced. 

Probably  you  have  uses  for  this  facility  in  your  organisation. 

Yours  sincerely, 

fM. 

P.J.  CROSS 

Group  Leader  -  Facsimile  Research 
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Pays  1 


TRANSPORTS 

DESTINATION 

Etat  2 


MODE 

Air 


PAYS  D'ORIGINE  PAYS  DE  DESTINATION 
CONDITIONS  OE  LIVRAISON  DATE  74-03-03 
LICENCE  D'EXPORTATION  NATURE  OU  CONTRAT  (monntia) 

fab  ^  , 

CONDITIONS  DE  PAIEMENT  (dcnOanea, 


nomen¬ 

clature 

STATISTICAL 

NO. 

u  123  A 

MASSE  NEHE 
NET  WEIGHT 

MASSE  BRUTE 
GROSS 
WEIGHT 

5  leg 

VALEUR 

value 

DIMENSIONS 
MEASURE- 
.  MENTS 

1400  X 

OUANTITE 

PRIX 

montant 

LIVREE 

UNITAIRE 

TOTAL 

ET  UNITE 

UNIT  PRICE 

TOTAL 

QUANTITY 

amount 

DELIVERED 

AND  UNIT 

2 

104,33  T 

208,66  F 

10 

83,10  T 

831,00  F 

20 

15,00  F 

300,00  F 

D^bours 

incius 

Non  incius 

Embaliages 

Transport 

Assurances 

Moment  total  da  la  factura 

Acomptes 

MAROUES  ET  NUMEROS 
MARKS  AND  NUMBERS 


74.21.456.4^.2  A 


QUANTITE 
COMMANOEE 
ET  UNITE 
QUANTITY 
ORDERED 
AND  UNIT 

2 

10 

25 


N*  ET  REF. 
DE  I'ARTICLE 


AF-809 

S8-T4 

ZI07 


NOMBRE  ET  NATURE  OES  COLIS  : 
denomination  OE  LA  MARCHANOISE 
NUMBER  AND  KING  OF  PACKAGES: 
DESCRIPTION  OF  GOODS 


1  Compoaants 


DESIGNATION 


Circuit  integre 
Connecteur 

Composant  indetermine 


Costs 


Picking 


Freight 


Insurance 


FIGURE  3 


invoice  amount 


Installment 


NET  TO  BE  PAID 


NET  A  HECLER 


1431,80 


L’=r<,r.  d.  Uncmen.  ..  de  rdaU.a.lOd  des  .pplicd.ion,  ta»  I'obi..  da 

Uoc'^Lis  biTau  conlraira  de  procdder  par  «ap.s  par 

paliera  adcc.^. 

re?“.,':^.ia.Ts7mtu  »«:•  . . on.  v„  donner  la  priorild  pour 

leur  realisation.  "ehef  de  oroiet"  responsable  successivement  de  sa 

chef  de  oroiet  doit  des  le  depart  considerer  que  son  activite  a  une  vocation 

chef  ae  proje  n  est  aide  d'une  equipe  d'analystes-programmeurs 

rS2£«  srr  u-.:rr  5 

.Toi  :n^r.pTir  rpr.rdT^^ 

divers  concernes  par  le  projet, et  comporte  obligatoirement  un  bon  analysts  attache  i  ap 
plication. 

II  .  L’IMPLANTATION  GEOGRAPHIQUE  D’UN  RESE.4U  INFORMATIQUE  PERFORaMANT 

L'oreanisation  de  I'entreprise  franqaise  des  telecommunications  repose  sur  1 'existence  de 
^0  regions.  Des  calculateurs  ont  6t6  implantes  dans  le  pass6  au  moins 

importantes.  Ontrouve  ainsi  des  machines  Bull  Gamma  30  4  Lyon  et  Marseille,  des  GE  42 
4  Lille  Bordeaux,  Toulouse  et  Montpellier,  un  GE  437  4  Massy,  emin  quelques  niachines 
Bull  300  TI  4  programmes  cables  etaient  rScemment  ou  sont  encore  en  service  dans  les 
regions  de  Nancy,  Nantes,  Limoges.  Poitiers  et  Rouen  ;  ce  pare  est  essentiellement  utilise 

rrlvii^.^sUa^Slpart^Sl  fidiiers  nScessaires  aux  applications  decrites  plus  haut  peuvent 
etregeresen temps  differ e.  un  certain  nombre  d'entre  eux  devront  n6cessairement  6tre  ac- 
cessibles  voire  mis  4  jour  en  temps  reel  :  parmi  ces  derniers  le  fichier  commercial  des 
abonnes.  ’le  fichier  des  renseignements,  le  fichier  des  circuits,  le  fichier  technique  des 
abonnes  contiendront  des  quantites  considerables  d’informations. 

Le  volume  total  de  caractfer’es  4  gerer  en  phase  finale  sur  un  ordinateur  ayant  en  charge 
quelques  500  000  abonnes  a  ete  estime  4  un  milliard  de  caracteres  au  moms.  Au  moms  le 
tiers  des  donnees  seront  concernees  par  des  traitements  en  temps  reel. 

Aucun  des  calculateurs  enumeres  plus  haut  ne  permettait  d'envisager  de  tels  traitements. 

L'integration  progressive  detoutesies  applications  suppose  la  creation  d'un  support  commun 

pour  tSites  lls  informations,  une  veritable  "Banque  de  donnees  .  repartie  sur  des  moyens 
detraitement  nationaux  et  r6gionaux.  et  qui  devra  rester  alimentee,  mise  a  jour  en  perma. 
nence.  4  partir  de  la  base  de  I'entreprise,  e'est-a-dire  les  chantiers.  les  magasins,  les 
cuichets  des  services  d'abonnement.  les  services  de  personnel  etc. 

L'etude  des  diff6rents  fichiers  4  constituer  a  done  permis  de  definir  les  principales  carac- 
teristioues  du  r6seau  d'ordinateurs  nouveau.x  a  mettre  en  place  pour  aborder  la  realisation 
dusvst6meinformatif.  L'obligation  de  faire  appel  ii  des  ordinateurs  de  troisieme  generation. 
trespuissantsetdot4sdevolumineuses  m6moires  de  masse,  a  conduit  a  en  reduire  substan- 
tieilement  le  nombre. 

L' implantation  de  sept  centres  de  calcul  interrOgionaux  constituera  un  compromis  entre  : 
d'une  partle  d6sir  de  reduire  le  coOt  economique  de  I'ensemble,  de  faciliter  la  coordmation 
des  Equities  d'informaticiens;  et  d'autre  part  le  refus  de  creer  des  centres  trop  importants 
difficUes  4  gerer  e:  a  diriger.et  posani  des  problemes  delicats  de  securitb.  Le  regroupe- 
ment  des  traitements  relatifs  a  plusieurs  regions  sur  chacun  de  ces  sept  centres  permettra 
de  leur  donner  une  taiile  relativernent  homogene.  Chaque  centre  'u-  rera  '  env.r-in  un  mil¬ 
lion  d'abonnes  a  la  :m  au  VIeme  Plan. 

La  mise  en  place  de  ces  centres  a  lebute  au  lebut  de  I'anjire  ;  .  'in  oruinateur  ;Iub  >0  de 

la  Compagnie  Internationale  pour  I'lnformatinue  a  etc  -.nstaih-  T  lulouse  en  idvr-.er  ,  la 
mfime  machine  vient  d'etre  mise  en  service  au  centre  de  caicul  .nterregionai  ae  Bordeaux. 


-  FIGURE  4 

Photo  n®  1  -  Documeru  tr'rs  dense  lettre  L,5mm  de  haut 

Restitution  photo  n®  9 


Cela  est  d’autant  plus  valable  que  T  A/  cst  plus 
grand.  A  cet  igard  la  figure  2  represente  la  vraie  courbe 
donnant  \^(J)\  en  fonction  de/pour  les  valeurs  nunii- 
riques  indiquees  page  pr^dente. 


Fio.  2 


Dans  ce  cas.  le  filtre  adapte  pourra  etre  constitue, 
conformement  a  la  figure  3,  par  la  cascade  ; 

—  d’un  filtre  passe-bande  de  transfer:  unite  pour 
/o  ^  ^ /o*-^ transfer:  quasi  nul  pour 

/  <  /o  et  /  >  /o  -  A  /;  filtre  ne  modifiant  pas  la  phase 
dcs  coTTiposanis  le  iraversant  ; 


Fjc.  3 


—  filtre  suivi  d’une  ligne  a  retard  (LAR)  disper¬ 
sive  ayani  un  temps  de  propagation  de  groupc  7/, 
dccroissani  lincairemeni  avec  la  frequence  /  suivant 
I’expression  : 

r*  =  To+(/o-/)  ^  (avec  To  >  T) 


(voir  fig.  4). 


Fjc.  4 


telle  ligne  a  retard  est  donnee  par  : 


Et  cette  phase  est  bien  I’oppose  de 
a  un  dephasage  constant  pres  (sans  importance) 
et  a  un  retard  To  pres  (inevitable). 

Un  signal  utile  S(t)  traversant  un  tel  filtre  adaptd 
donne  a  la  sortie  (a  un  retard  To  pres  et  a  un  depha¬ 
sage  pres  de  la  porteuse)  un  signal  dont  la  transformee 
de  Fourier  est  reelle,  constante  entre  /o  et  /o  +  A/, 
et  nulle  de  part  et  d’autre  de /o  et  de /o  +  A/,  c_]^esi- 
a-dire  un  signal  de  frequence  porteuse /o  +  A//2  et 
dont  I’enveloppe  a  la  forme  indiquee  a  la  "Sure  5, 
oil  I’on  a  represente  simultanement  le  signal  5(r) 
et  le  signal  S,(/)  correspondant  obtenu  a  la  sortie 
du  filtre  adapte.  On  comprend  le  nom  de  recepteur 
a  compression  d’impulsion  donne  a  ce  genre  de 
filtre  adapte  :  la  «  largeur  »  (A  3  dB)  du  signal  corn- 
prime  etant  egale  a  1/Af,  le  rapport  de  compression 

est  de  -I-  =  TA/ 

1/A/ 


Fio.  5 


On  saisit  physiquement  le  phenomene  de  com¬ 
pression  en  realisant  que  lorsque  le  signal  S(t)  entre 
dans  la  ligne  i  retard  (LAR)  la  frequence  qui  entre 
la  premiere  a  I’instant  0  est  la  frequence  basse  /q, 
qui  met  un  temps  T,,  pour  traverser.  La  frequence  / 

entre  a  I’instant  t  =  ( /  — /«)  —  temps 

A/ 

~  \  .ZL  pour  traverser,  ce  qui  la  fait  ressortir 


A/ 

FIGURE  5  effaletncnt  Ainsi  done,  le  snrnal  S(t^ 
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FIGURE  7 
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memorandum 


NOTES: 

R  =  Sampling  density  (pels  per  25.4  mm) 

P  =  Total  scan  line  length  (^Is)  =  219.46  mm 
W  =  A4  paper  width  (pels)  =210  mm 
B  =  Blanking  margin  (pels) 

L  =  Nominal  number  of  scan  lines 
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Figure  9.  Scanning  geometry 
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3.  IMAGE  STORAGE  AND  RETRIEVAL 


The  total  number  of  bytes  required  to  store  each  of  the  images,  as  a 
function  of  resolution  (or  sampling  density),  is  shown  in  Table  1 .  Note  that  at  200 
pels  per  25.4  mm  about  0.5  megabytes  are  required,  while  at  600  pels  per  25.4 
mm  about  4.5  megabytes  are  needed.  In  order  to  reduce  the  storage  requirements 
and  make  DOS  diskettes  a  practical  storage  media,  the  data  must  be  compressed. 

Table  2  is  a  list  of  the  32  compressed  files  produced  by  scanning  8 
documents  at  4  resolutions.  The  files  are  stored  on  8  DOS  diskettes,  one 
document  per  diskette,  with  a  decompression  program  stored  on  each  of  the 
diskettes.  To  retrieve  an  image  from  a  compressed  file,  first  copy  the  compressed 
file  to  a  hard  disk  and  type: 

pkunzip  < filename  >  [enter] 

The  compressed  file  will  be  expanded  and  the  result  written  to  the  hard  disk. 

Make  sure  that  the  hard  disk  has  enough  space  to  contain  the  expanded  image. 
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Table  1 .  Image  storage  requirements 


pel  density  pels  per  line  scan  lines 
(pels  per  2S.4mm) 


pels  per 
page 


bytes  per 
page 


200 

1728 

2339 

4041792 

505224 

300 

2592 

3508 

9092736 

1136592 

400 

3456 

4677 

16163712 

2020464 

600 

5184 

7016 

36370944 

4546368 
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Table  2.  Compressed  files 


DOCUMENT 

PEL  DENSITY 

FILE  NAME 

UNCOMPRESSED 

COMPRESSED 

BYTES 

BYTES 

Document  #1 

200 

DOC1_200.2IP 

505224 

33981 

300 

DOC1_300.ZIP 

1136592 

61770 

400 

DOC1_400.ZIP 

2020464 

96736 

600 

DOC1_600.ZIP 

4546368 

193956 

Document  #2 

200 

DOC2_200.ZIP 

505224 

31166 

300 

DOC2_300.ZIP 

1136592 

53431 

400 

DOC2_400.ZIP 

2020464 

82740 

600 

DOC2_600.ZIP 

4546368 

150728 

Document  #3 

200 

DOC3_200.ZIP 

505224 

48828 

300 

DOC3_300.ZIP 

1136592 

87117 

400 

DOC3_400.ZIP 

2020464 

134523 

600 

DOC3_600.ZIP 

4546368 

254890 

Document  #4 

200 

DOC4_200.ZIP 

505224 

105163 

300 

DOC4_300.ZIP 

1136592 

186133 

400 

DOC4_400.ZIP 

2020464 

283570 

600 

DOC4_600.ZIP 

4546368 

541148 

Document  #5 

200 

DOC5_200.ZIP 

505224 

57283 

300 

DOC5_300.ZIP 

1136592 

100292 

400 

DOC5_400.ZIP 

2020464 

153352 

600 

DOC5_600.ZIP 

4546368 

288427 

Document  #6 

200 

DOC6_200.ZIP 

505224 

35213 

300 

DOC6_300.ZIP 

1136592 

62364 

400 

DOC6_400.ZIP 

2020464 

99982 

600 

DOC6_600.ZIP 

4546368 

197443 

Document  #7 

200 

DOC7_200.ZIP 

505224 

119462 

300 

DOC7_300.ZIP 

1136592 

209547 

400 

DOC7_400.ZIP 

2020464 

316710 

600 

DOC7_600.ZIP 

4546368 

242113 

Document  #8 

200 

DOC8_200.ZIP 

505224 

47752 

300 

DOC8_300.ZIP 

1136592 

83438 

400 

DOC8_400.ZIP 

2020464 

126685 

600 

DOC8_600.ZIP 

4546368 

234532 
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